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PREFACE 

The  research  reported  in  this  Memorandum  was  begun 
under  sponsorship  of  the  National  Aeronautics  and  Space 
Administration.  A  considerable  portion  of  the  work  was 
conducted  at  the  Langley  Research  Center,  Langley  Air 
Force  Base,  Hampton,  Virginia,  where  Mr.  Elliott  is  a 
NASA  aerospace  engineer.  Mr.  Dreyfus,  of  The  RAND  Corpor¬ 
ation,  participated  in  the  project  while  on  a  leave  of 
absence  from  RAND.  The  project  was  completed  and  the 
manuscript  prepared  following  Mr.  Dreyfus'  return  to  RAND, 
his  share  of  that  work  being  sponsored  by  U.S.  Air  Force 
Project  RAND. 

The  linear  feedback  guidance  scheme  developed 
simplifies  the  problem  of  correcting  space  craft  inflight 
disturbances,  which  is  particularly  important  in  space 
vehicle  rendezvous  applications  and  initiating  post-launch 
flight  corrections. 

NASA  has  agreed  to  publication  of  the  work  in  this 

form. 
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SUMMARY 


A  theory  developing  a  linear  feedback  guidance 
scheme  to  correct  for  inflight  disturbances  of  a  vehicle 
during  the  course  of  a  space  mission  is  presented.  The 
theory  is  predicated  on  the  use  of  a  nominal  optimal 
trajectory.  The  scheme  consists  of  a  linear  combination 
of  (1)  perturbations  of  the  vehicle  state  from  its 
nominal  state,  and  (2)  time-varying  gains  to  determine  the 
control  correction  required  to  satisfy  the  constraints  of 
the  trajectory  in  an  optimal  fashion.  Exact  knowledge  of 
the  state  of  the  vehicle  is  assumed. 

An  analysis  of  numerical  results  for  an  idealized 
rocket  trajectory  problem  shows  the  linear  feedback 
guidance  scheme  to  be  effective  over  a  wide  range  of 
chosen  perturbations. 
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I .  INTRODUCTION 


Suppose  that,  for  a  particular  vehicle  and  space 
mission,  an  optimal  trajectory  has  been  determined.  Let 
us  call  this  optimal  trajectory,  associated  with  the 
specified  initial  conditions  and  computed  before  launch, 
the  nominal  trajectory.  Suppose  further  that  due  to 
abnormalities  during  flight  the  vehicle  deviates  from 
this  trajectory.  Tor  the  specified  mission  there  is  now 
a  new  optimal  trajectory  associated  with  the  new  state 
(i.e.,  position,  velocity,  mass,  etc.)  of  the  vehicle, 
and  neither  a  return  to  the  nominal  trajectory  nor 
continued  use  of  the  originally  programmed  control  is 
desirable. 

This  Memorandum  develops  a  simple  linear  feedback 
guidance  scheme  which  determines  the  control  correction 
necessary  to  yield  a  "corrected"  optimal  trajectory  if 
the  vehicle  is  slightly  perturbed  from  its  original 
optimal  trajectory.  Exact  measurement  of  the  state  of 
the  vehicle  at  discrete  time  intervals  is  assumed.  The 
control  correction  rule  is  a  linear  function  of  the  state 
variable  deviations  from  nominal  with  time-varying 
coefficients  that  are  computed  and  stored  before  flight. 

The  technique  of  derivation  is  akin  to  dynamic 
programming  [I],  We  shall  define  an  auxiliary  function 
of  the  state  variables  to  be  called  the  optimal  return 
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function,  characterize  it  by  means  of  the  principle  of 
optimality,  and  then  deduce  the  guidance  scheme  from  the 
resulting  relations.  We  shall  approximate  the  optimal 
return  function  in  the  neighborhood  of  a  given  optimal 
nominal  trajectory  rather  than  compute  the  optimal  return 
function  for  all  state  space.  This  is  accomplished  by 
computing  the  first  and  second  partial  derivatives  of 
the  optimal  return  function  associated  with  the  nominal 
trajectory.  Thus,  at  least  to  first  order,  we  obtain  the 
optimal  return  function  for  any  state  slightly  perturbed 
from  the  nominal  trajectory.  For  large  deviations  from 
nominal ,  the  approximation  is  in  error  and  the  guidance 
scheme  is  inaccurate.  The  meaning  of  "small  deviation" 
depends  on  the  extent  of  nonlinearity  of  any  particular 
problem  and  can  best  be  determined  by  numerical  experi¬ 
mentation. 

The  problem  stated  above  has  also  been  studied  by 
Breakwell  and  Bryson  [2]  and  by  Kelley  [3],  Their 
mathematical  approaches  differ  considerably  from  ours, 
and  the  computation  of  their  linear  guidance  rule  involves 
quite  different  numerical  operations.  Presumably,  however, 
the  end  results  are  mathematically  equivalent.  Questions 
of  comparative  numerical  efficiency  and  accuracy  of  the 
three  methods  remain  to  be  investigated. 
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II .  THE  PROBLEM 

The  optimal  trajectory  problem  we  shall  consider 
has  the  following  discrete-time  statement.  Find  that 
sequence  of  controls,  » 

{u(t)},  t  =  tQ,  tQ  +  A,  tQ  +  2A,  -  -  -  (2-1) 

such  that  the  dynamical  system  governed  by  the  difference 
equations 


x^t  +  A)  =  x^t)  +  ^(^(t),  ....  (2-2) 

xN-i(t),  t  ,u(t)j  A ,  i  -  1 . N  -  1 

xt(t  )  -  x^  ,  i  =  1,  .  .  . ,  N  -  1  (2-3) 

o 

evolves  in  such  a  way  that  the  function 

cp^c^(t)  ,  .  •  • ,  (2-4) 

is  minimized  at  some  unspecified  time  t  determined  by 
the  satisfaction  of  the  m  terminal  conditions 

*j  fa(t)  ,  .  .  xN_1(t),  t j  -  0  (2-5) 

j  •  1,  .  .  . ,  m  <  N. 
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That  is,  we  want  to  determine  a  control  function  u(t) 
that  minimizes  an  objective  function  cp^,  ...»  x^^t 
at  a  terminal  time  determined  by  the  simultaneous  satis¬ 
faction  of  m  terminal  conditions.  For  the  sake  of 
simplicity  of  notation  we  define  and  use 


We  state  the  problem  discretely  since  it  must  be 
solved  digitally  and  used  mechanically  in  a  discrete  way. 
Analogous  continuous  results  are  easily  obtained  by 
letting  A  approach  zero. 
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III.  THE  GUIDANCE  EQUATIONS 

We  assume  that  associated  with  any  set  of  feasible 

initial  conditions  x^  ,  i  ■  1,  .  .  . ,  N,  there  exists 

o 

an  optimal  trajectory  to  the  desired  terminal  state  and 
an  associated  minimal  value  of  cp  at  the  end  point.  Let 
us  define  the  optimal  return  function  s|x^  ,  .  .  . ,  x^  j 
by 


the  value  of  cp  at  the  terminal 

time  determined  by  iK  =  0, 

where  the  vehicle  starts  in  state 

x^  ,  .  .  . ,  Xjj  and  an  optimal 
o  o 

trajectory  is  followed. 


The  function 


satisfies  the  relation 
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Equation  (3-1)  is  a  statement  of  the  fact  that  the 
minimal  terminal  value  of  cp  (i.e.,  optimal  return) 
associated  with  a  given  state  at  time  tQ  equals  the 
optimal  return  associated  with  the  state  at  time  (t0  +  A) 
yielded  by  the  control  action  u  at  time  tQ  where 
u(tQ)  has  been  so  chosen  as  to  yield  that  state  at  time 
(tQ  +  A)  with  minimal  return.  This  reasoning  is  called, 
by  Bellman,  the  principle  of  optimality. 

The  above  characterization  of  S  implies  two 
equations  .  The  first  of  these  is  a  necessary  condition 
for  optimality;  i.e.,  that  the  right-hand  side  of  (3-1) 
be  a  minimum  with  respect  to  all  admissible  controls  uQ. 
If  u  is  unbounded,  we  minimize  by  differentiation  and 
obtain 


In  equation  (3-2)  we  have  used  the  chain  rule  for  the 
derivative  of  a  function  whose  arguments  are,  in  turn, 
functions.  The  second  equation  is  a  statement  of  the 
fact  that,  on  the  optimal  trajectory  from  any  initial 
state,  the  value  S  is  invariant,  regardless  of  the 
particular  initial  state.  That  is 

S(*V  •  *  •»  xN0j  "  S(^l0  +  fl0A»  •  •  *’  *N0  +  fNQA) 
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where  the  are  evaluated  using  the  optimal  uQ. 

o 

Note  that  the  optimal  uQ,  uQ*,  defined  by  equation  (3-2) 

depends  on  the  point  in  state  variable  space  since  both 
d  s 

■5—^  and  f.  depend  on  the  state. 
lo 

We  now  wish  to  consider  the  nominal  trajectory, 
which  is  the  optimal  trajectory  from  a  particular  specified 
initial  state.  If  we  think  of  time  as  indexing  each 
point  in  state  space  along  this  trajectory,  S  has  the 
same  contant  value  at  each  time. 


etc. , 


have  numerical  values  which  depend  on  the  time,  and  the 
combination  of  the  derivatives  defined  in  (3-2)  is  zero. 

For  convenience  and  to  simplify  notation  we  use 
double  subscript  notation  to  denote  summation.  Also 
since  we  will  be  concerned  with  adjacent  states  in  state 
variable  space  (i.e.,  proceeding  discretely  in  time)  we 
drop  the  subscript  o  and  use  the  following  notation  to 
denote  evaluation  of  the  partials  at  the  appropriate  time: 


3S(x-^  +  .  .  . ,  Xjj  +  fjjA) 

a(x3  +  fjA) 


N 


t+A 
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Rewriting  equations  (3-2)  and  (3-3)  in  this  notation  we 
have 


as  a^ 

TO"| 

t 


+A  |t 
S 


t-r d 


(3-4) 


(3-5) 


Our  object  is  to  determine 

J 


the  rate  of 


change  in  optimal  control  u  associated  with  a  per¬ 
turbation  in  the  nominal  value  of  at  time  t.  Since 

equation  (3-4)  determines  the  optimal  control,  we  take  its 

partial  derivative  with  respect  to  Xj  keeping  in  mind 

* 

that  the  optimal  control  decision  u  may  change  due  to 
this  perturbation  in  x^ .  This  yields 


3f, 


a2s 

dx^dXj 


au 


afi 

a^s 

a 

s£k 

r 

t+A 

t 

t+A 

+  Jxj 

t  t 


du 


A  + 


as 

ix. 


t+A 


auax1 


a2f. 


au“ 


(3-6) 


au 

dx 


j 


-  0 


which  gives  in  terms  of  the  first  and  second  partial 

derivatives  of  S  evaluated  on  the  nominal  trajectory. 
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9  c 

To  determine  a  recurrence  relation  for  on  the 

j 

nominal  trajectory  we  take  the  partial  derivative  of  (3-5) 


with  respect  to  x^ .  This  yields 


AS 

AS 

AS 

&fi 

+  *h 

t 

t+A 

t+A 

(3-7) 


where  we  have  used  the  fact  that 

AS 


*5 


afi 

"Su" 


t+A 


is  zero  (see  equation  (3-4)). 

To  obtain  a  recurrence  relation  for 


a2s 


sirrsq;  011  the 

nominal  trajectory  we  differentiate  (3-7)  with  respect  to 
x. ,  obtaining 


A2S 

a2s 

a2s 

J- 

&fk 

Axj  Ax^ 

AXj  Ax^ 
t 

Sxj4xk 

t+A 

t+A 

Af,. 


)u 


Au 


a2s 


A  +  3x^8x^ 


t+A 


afi 


a2s 


Ax.  A 


i0xk 


• 

&fk 

4£k 

Au* 

>{L 

+  *u" 

t+A 

t 

t 

t 

_ 

(3-8) 
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iXj  dx^ 


+  7u TST  *xj  A 


Given  the  first  and  second  partial  derivatives  of 

S  at  time  t  +  A  along  the  nominal  trajectory,  equation 

d  * 

(3-6)  allows  us  to  compute  r—  at  time  t.  Equations 

j 

(3-7)  and  (3-8)  yield  the  first  and  second  partial  deri¬ 
vatives  of  S  at  time  t  thus  allowing  the  continuation 
of  the  backward  recursion. 

- if 

Once  we  know  ,  the  change  in  optimal  control 

j 

dictated  by  a  change  in  state,  as  a  function  of  time 
along  a  nominal  trajectory,  the  linear  rule 


11  ★ 

6u*  (t)  -  Y,  l^7<t)6xj(t) 


(3-9) 


yields  the  change  in  u  required  to  correct  for  devia¬ 
tions  from  nominal,  6xj ,  in  the  state  variables. 

This  is  the  linear  feedback  guidance  scheme.  The 
time-varying  coefficient  functions  in  (3-9),  •*^-(t)  , 

j 

are  computed  along  the  nominal  trajectory  before  flight 
by  recursion  of  equations  (3-6),  (3-7),  and  (3-8),  and 
stored  in  the  guidance  computer.  During  flight,  at  time 
tQ  +  kA  when  the  states  are  observed,  the  state  devia¬ 
tions  from  nominal  are  multiplied  by  the  proper  stored 
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coefficients  and  summed,  as  indicated  by  equation  (3-9)  , 

ie 

to  obtain  the  proper  adjustment,  6u  (t) ,  in  control. 


In  the  next  section  we  derive  the  terminal  values  of 
32S 

and  d -  3'x  that  allow  the  backward  recursion  of 
i  k 


equations  (3-6),  (3-7),  and  (3-8). 
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IV.  BOUNDARY  VALUES 

The  proposed  guidance  scheme  involves  the  computation 
of  the  second  partial  derivatives  of  S  backward  from  the 
terminal  time  along  a  nominal  (optimal)  trajectory  as  well 
as  the  necessary  partial  derivatives  of  the  control.  The 
terminal  values  can  be  deduced  by  at  least  two  methods. 
Method  I  yields  some  physical  insight  into  the  nature  of 
the  limiting  process  involved;  however,  it  is  cumbersome 
and  impractical  for  complicated  problems.  Method  II 
appears  to  be  more  practical  and  is  recommended  for  general 
usage. 

Method  I- -A  set  of  terminal  equations 

,  Xjj)  “0  J  =  1 ,  •  •  .  ,  m  (^— 1) 

is  satisfied  at  the  final  time  T  by  the  nominal 
trajectory.  Let  a  change  from  nominal  in  any  state  at  a 
time  t  shortly  before  T  occur.  The  control  must  then 
be  adjusted  so  that  the  m  constraining  equations  remain 
satisfied  at  the,  perhaps  changed,  terminal  time. 

Let  us  evaluate  Yj  at  terminal  time  T  by  means 
of  a  Taylor  series  expansion  about  time  t  shortly  before 
T.  Thus 


2 

Yj(T)  -  Yj(t)  +  (T  -  t)fj(t)  +  A  j  ^  +  •  •  •  •  (4-2) 
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The  fact  that  Y^ (T) ,  where  T  is  the  terminal 
time  which  may  change  if  we  change  state  at  time  t, 
must  equal  0  at  the  terminal  time,  yields  the  m 
equations 


SY.(T)  3Y  (t)  9T 

SxfrtX  =  0  '  -S^TtT  +^TtT  ’j(t) 


(4-3) 


+  (T  - 


t) 


a^Ct) 

dx|(.t) 


9Y  9u*(t) 
+  9u(t)  3xi(t) 


J  =  1 ,  •  •  «,m 


corresponding  to  state  x^. 

Since  depends  on  the  control  u  ,  Y^  upon  u 

•  "ic  ~ 

and  u  ,  etc.,  the  above  m  linear  equations  can  be 
solved  for  the  m  quantities 


ST  Su*(t)  3u*(t)^m_1^ 

axt(t)  ’  dxi(t) ’  *  *  •»  axt(t) 


(4-4) 


in  terms  of  the  nominal  state  variables  and  control. 

Now,  writing  the  objective  function  cp,  similarly, 
as 


•  > 


cp(T)  -  cp(t)  +  (T  -  t)cp(t)  +  .  . 


(4-5) 
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we  have 


+  bT 

■SxJTtT 


<P(t) 


+  (T  -  t) 


(4-6) 


Since  the  left  side  of  equation  (4-6)  is  by  definition 


t 


we  now  have  the  values  of  the  first  partial  derivatives  of 
S  along  a  nominal-  trajectory  at  a  time  t  just  before 
the  terminal  time  T.  As  t  -*  T 


Su(t)  Su(t) 

■SxJTtT’ 


etc.,  in  expression  (4-4)  approach  infinity,  so  we  begin 
the  backward  recursion  a  small  time  A  before  the  terminal 
time.  Then  these  quantities  are  large  but  finite. 

To  obtain  the  second  partial  derivatives  of  S  we  take 
the  partial  derivative  of  (4-6)  after  substituting  for 

ST  Su*(t) 

Sxi( t)  *  T^ttT  » 
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etc . ,  their  known  equivalents  in  terms  of  state  and  con¬ 
trol  variables.  This  gives  expressions  for  the  second  par¬ 
tial  derivatives  of  S  near  the  endpoint  and  equations  (3-6) , 
(3-7),  and  (3-8)  can  now  be  solved  by  backward  recursion. 

As  will  be  seen  in  subsequent  sections  where  the 
above  scheme  is  illustrated,  the  algebra  involved  in  this 
method  of  evaluating  the  terminal  boundary  values  is  tedious. 

Method  II--This  method  is  an  adaptation  of  the  work 
of  Breakwell  and  Bryson  [2],  It  uses  a  portion  of  their 
scheme  for  optimal  linear  feedback  guidance  to  determine 
the  necessary  partial  derivatives  of  the  control  with  re¬ 
spect  to  the  state  variables  and  those  terminal  second 
partial  derivatives  necessary  in  our  approach. 

The  following  results  hold  at  the  terminal  point  of 
a  nominal  (optimal)  trajectory. 


**1 


»  •  •  • 

»xN)  0  j  -  1, .  .  .  ,m 

(4-7) 

dco  , 

Wl  + 

ay 

vj  ^  i  -  i,...,n 

(4-8) 

as 

■577 

f.  -0. 

(4-9) 

The  Vj  above  are  m  auxiliary  numbers  produced  during 
the  computation  process  of  determining  the  nominal  tra¬ 
jectory  [4].  On  a  neighboring  optimal  trajectory  (which 
is  within  some  region  where  the  assumption  of  linearity 
is  valid)  that  satisfies  the  constraints  of  the  problem, 
the  following  N  +  m  +  1  equations  hold.  (Time  is  assumed 
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to  be  the  independent  variable  in  the  following.) 

SY. 

+  Vj  dtj  -  0  j  ■  1,  .  .  m  (4-10) 

as  d  /as\  a2cp  a2Y 

6  9x^  +  ‘3t\3x^y  dtf  "  dxidxfc  6Xk  +  VJ  dxid*k  6X^  (4-11) 

9T.  d  /3cp  SyA 

+  dvj  +  ^tfeq  +  vj  ssy  dtf  1  "  1 »  •  •  • »  N 
as  as  af. 

8  1*1  h  +  TTt  8*k  '  0  •  <4’12> 


The  evaluation  of  the  terms  in  the  above  three  equa¬ 
tions  is  made  at  the  final  time  on  the  nominal  trajectory. 
It  should  be  noted  that  use  of  the  fact  that 


as  af. 


"5x7  du 


has  been  made  in  equation  (4-12).  The  dv,  and  M-jr^  ) 

as  j  v *1/ 

are  the  changes  in  and  -g^-,  at  the  terminal  time  t f 
of  the  nominal  trajectory,  required  to  satisfy  the  con¬ 
straints  of  the  problem  in  the  additional  time  dtf  due 
to  the  perturbations  6x^  at  nominal  terminal  time  tf. 
Thus  we  have  N  +  m  +  1  equations  in  2N  +  m  +  1  unknowns, 
Hence,  there  are  N  independent  variables  and  the 
remaining  variables  are  considered  dependent.  The 
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equations  relating  the  independent  variables  to  the 
dependent  variables  are  determined  at  the  terminal  time, 
tj,  of  the  nominal  trajectory  and  used  as  boundary 
conditions  for  the  backward  integration  of  the  linearized 
equations  applicable  to  an  optimal  trajectory;  that  is, 


the  differential  equations  of  motion  and  the  differential 

equations  for  the  multipliers  (here  .  Also  in 

xi 

order  to  solve  for  6u,  the  necessary  condition  for 


optimality  is  linearized.  These  equations  are 


d  &f.  df . 

^t<ftxi>  “  6xj  +  'Sxr  6u  (4"13) 

j  J 


(4-14) 


(4-15) 


The  linearized  equations  are  solved  backward  in  time  N 
times,  where  each  time  all  but  one  of  the  variables  at  the 
terminal  time  are  assumed  zero.  The  value  of  one  is 
assigned  to  the  remaining  term  in  turn  until  all  N  terms 
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have  been  used  in  this  manner.  This  will  be  clearer 
after  examination  of  the  example  problem.  Next  we  solve 
for  (6  at  time  t  in  terms  of  the  6x^  at  time 

t.  Then  holding  all  6x^  except  6Xj  zero,  we  solve 


at  time  t. 


This  is  the  desired 


X^OXj 


In  similar  fashion  we  solve 


equation  (4-15)  for  6u  I  in  terms  of  6x^  and 

It  t 

interpret  as  We  then  use  these  values  as  the 

boundary  values  to  initiate  recurrence  equations  (3-6)  , 
(3-7),  and  (3-8). 
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V.  A  SIMPLE  EXAMPLE 


To  Illustrate  the  above  concepts,  let  us  study  a 
simple  problem  with  a  known  solution.  We  wish  to  program 
the  direction  v(t)  so  that  a  particle  moving  in  a  plane 
at  a  constant  velocity  V  reaches  a  fixed  point  (x^,  y^) 
in  minimum  time.  Here,  in  our  previous  notation, 

x(t  +  A)  ■  x(t)  +  (V  cos  y(t))A  (5-1) 

y(t  +  A)  -  y(t)  +  (V  sin  V(t))A 
co(x,y,t)  -  t 
^(x.y.t)  -  x  -  xx  -  0 
Y2Cx,y ,t)  -  y  -  y1  -  0 

where  cp  is  the  objective  function  to  be  minimized  and 
and  ?2  are  terminal  conditions. 

Starting  at  the  point  (xQ,  yQ)  we  determine,  in 
this  case  by  inspection,  that  the  minimum  time  trajectory 
is  a  straight  line  between  (xQ,  yQ)  and  (x^,  y^). 

The  optimal  time  of  arrival  at  (x^,  y^)  starting 
at  any  initial  point  (xQ,  yQ)  at  time  tQ  is  then 

\/<x i  -  x0>2  +  ■  y0)2 

j - — y - - 


Sfrotfo^o*  “  fco  + 


(5-2) 
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To  deduce  the  appropriate  boundary  conditions  by 
Method  I,  we  let 

^(T)  -  x(T)  -  x1  -  x(t)  -  Xj_  +  (T  -  t)V  cos  Y  (5-3) 
+  v  sin  YY)  +  .  .  . 

VT>  -  y(T)  -  yx  -  y(t)  -  yx  +  (T  -  t)V  sin  Y 
+  (T  2  ^2(V  cos  yy)  +  .  .  .  . 

Then 


9Y,  (T)  ST  3y 

-S^tT  ■  0  -  1  +  V  cos  y  +  (T  -  t)<-  V  sin  y)  (5-4) 

st,(t)  ST  ay 

S^tT  .  0  -  55  V  sin  y  +  (T  -  t)  V  cos  Y  ^  ,  (5-5) 


neglecting  higher  order  terms. 
Hence , 


8T  _  cos  Y .  5Y  _  sin  Y 

- 7“’  V(T  -  t)  • 


(5-6) 
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Con  tinuing  ,  we  derive  a  typical  terminal  value  of  a 
second  partial  derivative  of  S.  Partial  differentiation 
of  (5-7)  with  respect  to  y  yields 

sin  y  sin  Y  cos  Y 

3x8y  V  VZ(T  -  t) 

Again,  since  the  exact  form  of  S  is  known  in  this 
example,  this  result  can  be  verified: 


»2s  _  _  <xi  -  ^0>(yi  -  yQ> 

ax03y0  ’  "  V((Xl  -  x0)Z  +  (Yl  -  y0)z)3,z 


(5-10) 


_ 22 _ 

V(T  -  t)V 


3 


sin  y  cos  Y 
V2(T  -  t) 


For  the  purpose  of  illustrating  the  use  of  Method  II, 
we  observe  as  pointed  out  in  Chap.  V  of  Bellman  and 

*  Q 

Dreyfus  [1],  that  r— “  *-s  equivalent  to  X  ,  the 

0  x^ 

familiar  multiplier  functions  of  the  classical  calculus 
of  variations.  We  also  replace  the  difference  equations 
(5-1)  by  differential  equations  and  include  time  as  a 
state  of  the  system.  Our  system  becomes 


•  • 
x  ■  V  cos  Y  Xx  -  0 

y  ■  V  sin  Y  X^  ■  0 

t  • 


(5-11) 


t  -  1 


0 
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The  optimal  policy  to  be  followed, 

-  XV  sin  Y  +  XV  cos  Y  -  0 
x  y 


* 

Y  ,  is  given  by 
+  X 

•  tan  Y  ■  . 


x 


(5-12) 


From  the  terminal  conditions  of  the  optimal  nominal 
trajectory  we  have  (where  superscript  f  denotes  final 
value) 

Xxf  *  V  Xyf  “  v2;  "  1  *  (5-13) 

Since  the  time  rate  of  change  of  the  multipliers  is  zero 
we  have 


Vl*  “  V2  alU*  ^  * 


(5-14) 


From  the  transversal ity  condition  at  the  terminal  point 

and  the  optimal  policy  condition 

f  ..  sin  Y  _ 

Xy - 7 - v2 


(5-15) 


cos  Y 

V 


1  * 


Then  applying  Method  II  to  determine  conditions  on  a 
neighboring  optimal  trajectory,  equations  (4-10),  (4-11), 
and  (4-12)  become 

&x  +  V  cos  Y  dt  ■  0  (5-16) 

fiy  +  V  sin  Y  dt  *  0 
6Xx  -  dvt  -  0 
6Xy  -  dv2  -  0 

flxt  -  0 

6X  V  cos  Y  +  4X  V  sin  y  +  b\  m  0  . 
x  y  c 
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Since  N  ■  3  and  m  -  2,  we  have  6  -  N  +  m  +  1 
equations  in  nine  unknowns  (2N  +  m  +  1) .  (The  unknown 
it  does  not  appear  since  its  coefficients  are  always 
zero.)  We  wish  to  choose  three  of  the  unknowns  as 
independent  such  that  the  coefficients  of  the  remaining 
six  when  written  in  matrix  notation  form  a  non-singular 
matrix.  That  is,  we  wish  to  form  the  matrix  equation 

AX  -  -  F 

and  solve  for  X 

X  -  -A-1F  . 

In  the  above,  we  choose  it,  iX^,  an<*  ^t  as 
independent.  Choosing  a  convenient  order  we  have, 
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and 


A 


-1 


V  cos  y 


1 


V  cos  Y 


0 

0 

0 

0 

0 


0  0  0  0 

V  cos  Y  0  0  0 

0  -V  cos  y  0  -1 

0  0  -V  cos  Y  0 

0  0  0  V  cos  Y 

0  0  0  -1 


Then  writing  the  state  variables  and  the  multipliers  in 
terms  of  the  independent  variables  we  have 


’fix 

0  -V  cos  Y  0 

6y 

m 

0  -V  sin  Y  0 

dt 

(5-19) 

fit 

0  0  1 

■  « 

fit  _ 

K 

-tan  Y 

0 

O' 

6X 

y 

- 

1 

0 

0 

dt 

6xt 

tf 

0 

0 

0. 

Cf 

.fit  . 

Now  the  linearized  differential  equations  (equations  (4-13) 
and  (4-14))  become 


-18) 
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(6x)  -  V2  sin  V  [  6Xy  COS  Y  -  6Xx  sin  y]  (5-20) 

It  w  -  -  V2  cos  Y  £  6Xy  cos  Y  -  6Xx  sin  y] 

It  <5t>  - 0 


It  <SXx>  "  It  (‘V  ■  It  (‘V  *  0 


where 

(6X  cos  Y  -  6X  sin  Y) 

6y  -  - 2 - ^ -  (5-21) 

X  cos  Y  +  X  sin  Y 
x  y 


has  been  simplified  and  substituted  into  equations  (5-20). 
Setting  each  of  the  independent  variables,  in  turn,  equal 
to  1  while  the  other  two  are  set  equal  to  0  and  integrating 
the  linearized  differential  equations  backward  in  time 
with  initial  conditions  determined  from  the  matrix 
equations  (5-19),  we  have 


6X 

X 

-tan  Y 

0 

0 

1  >7 

*o 

6X 

y 

mi 

1 

0 

0 

dt 

*xt 

t- 

0 

0 

0 

J 

tf 

6t  _ 

(5-22) 


6x 

-V2tan  Y  (t^-t) 

-V  cos  Y 

6y 

- 

v2(tf  -  t) 

-V  sin  Y 

6t 

• 

t 

0 

0 

(5-23) 
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Solving  equation  (5-22)  for  fiX^,  dt,  and  fit  at  time  t^ 
in  terms  of  the  state  variables  at  time  t  and  inserting 
in  equation  (5-23),  we  have 


6X  1 

X 

V  tan  Y  sin  Y  -V  sin  Y  0 

"fix* 

6X 

cos  Y 

-  ■■  —  ~ - 

-V  sin  Y  V  cos  Y  0 

&y 

y 

v3(tf-t) 

fix. 

JL 

0  0  0 

6 1 

t 

t 

cf 

(5- 


6X 


Now  we  let  6y  -  fit  *  0  and  interpret  —  m  — £,  etc. 

X  dx* 

Thus  we  have 


S2S  _  sin2Y 
ix1  V2(tf  -  t) 

2  2 
fi  S  _  CQS*Y 

ay2  v2(tf  -  t) 

fi2S  m  sin  Y  cos  Y 

‘  v2(tf  -  o  * 


(5-25) 


Solving  for  fiY  (equation  (5-21))  in  terms  of  fix  and  fiy 
at  t  in  a  similar  manner,  we  have 


&Y 

*x 


sin  Y 

V(tf  -  t) 


bY  _  cos  Y 

3y  *  VTtf  -  t)  * 


24) 


(5-26) 
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VI.  AN  IDEALIZED  PRACTICAL  PROBLEM 


To  further  illustrate  and  clarify  the  concepts 
developed  here,  an  idealized  practical  problem  has  been 
solved.  The  problem  is  to  determine  the  thrust  attitude 
program  of  a  rocket  vehicle  which  will  yield  the  maximum 
attainable  horizontal  velocity  at  specified  terminal 
conditions  of  time,  altitude,  and  vertical  velocity. 

The  following  idealizing  assumptions  are  made: 

1)  Vacuous  atmosphere 

2)  Flat  earth 

3)  Constant  gravitational  acceleration 

4)  Constant  thrust  acceleration. 

The  differential  equations  of  the  system  are 


h  •  v 

v  ■  a  sin  8  -  g 
u  ■  a  cos  0 
t  -  1 


(6-1) 

-V  +  v 

o 


where 

h  ■  altitude 

v  “  vertical  velocity 

u  ■  horizontal  velocity 

a  ■  thrust  acceleration  (constant) 

g  ■  gravitational  acceleration  (constant) 

6  ■  angle  of  thrust  acceleration  above  the  horizontal 
t  *  time. 
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It  is  well  known  that  one  can  determine  the  form  of 
the  optimal  policy  for  this  problem  from  the  necessary 
condition  for  optimality;  that  is 

X 

tan  6*  -  ^  -  -X.  t  +  X  .  (6-2) 

u  n  vo 

The  quantity  to  be  maximized,  cp ,  and  the  constraints, 
of  the  problem  are 

cp  -  u  (6-3) 

♦  l-  h-hf-0 

t2"v-vf«0 
*3  -  t  -  tf  -  °  . 

For  given  initial  conditions,  an  optimal  trajectory  can 
be  determined  which  will  satisfy  the  desired  terminal 
conditions .  Our  problem  then  is  to  determine  the  optimal 
control  action  necessary  to  correct  for  small  disturbances 
in  this  nominal  trajectory.  That  is,  we  wish  to  determine 
the  coefficients  of  &x  in  equation  (3-9).  For  this,  we 
need  to  determine  the  recurrence  relations,  equations 
(3-6),  (3-7),  and  (3-8),  and  the  boundary  conditions 
required  to  initiate  the  recursion. 
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VII.  RECURRENCE  RELATIONS  AND  BOUNDARY  VALUES 

To  derive  recurrence  relations  for  this  problem,  we 
define  S  by 

S  ■  optimal  cp  final  ■  optimal  u(t^)  .  (7-1) 

We  note  that  it  is  unnecessary  to  obtain  the 
d  6 

coefficient  -y^  in  our  guidance  rule  since  at  time  t^  a 
perturbation  of  time  of  6t  from  nominal  can  be  regarded 
as  a  deviation  in  position  and  velocity  from  the  nominal 
states  at  time  t^  +  At.  That  is,  any  one  state  can  be 
used  to  index  points  on  the  nominal  and  errors  can  then 
occur  only  in  the  other  states .  For  rendezvous  problems , 
time  is  not  always  the  best  indexing  variable.  Rewriting 
equations  (6-1)  as  difference  equations,  we  determine 
the  recurrence  equations  for  -y^,  -y^,  and  -y^j  to  be 


<>2s 

8h^v 


a  cos  6 1  +  k 


de 


t+A 


-  0 


(7-2) 


d2S 


dv 


a  cos  6 


t+A 


+  A 


&2S 


50, 


a  cos  6 


■Su 


t+A 

-  0 


+  k 


501 


where 
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k  -  A 


a2s 

dv2 


2 

a  cos  6 

as 

"  "5u 

a  cos  3 

t+A 

t 

t+A 

(7-3) 


as 

"3v 


a  sin  6 


t+A 


Since  k 


de 


0  we  must  have 


ae 

■jn 


o  . 


(7-4) 


Therefore  we  need  only  concern  ourselves  with  -jp  and 
deviations  in  horizontal  velocity  being  irrelevant  to 
the  control  action,  as  can  easily  be  seen  physically. 
The  necessary  recurrence  relations  are 


as 

TR 


t 


as 

■5E 


t+A 


(7-5) 


as 

"3u 


t 


1 


as, 

-  as. 

.  *  as, 

7^ 

|t 

t+A  | 

a2Si  .  a2s 


ah 


t+A 


t+A 


a  A  ae 

a  cos  0  A 


t 
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a2s,  _  a2s 


1  +  a  cos  9  A 


ae 


t+A 


+  A 


d2S 


dll 


t+A 


a2s 


3v 


m 

'  +  t  a2s  ] 

[ 1  +  a  cos  6  A 

la^2 

t 

\  / 

t+A  '  Vj 

i  d2s  ,  A  a2s 

+  M 


ae  ae 


t+A 


The  boundary  conditions  on  ,  and  the  necessary 

second  partials  of  S  have  been  determined  by  Method  I, 
and  verified  by  Method  II.  However,  the  use  of  Method  II 
only  is  illustrated  in  the  following.  Here  we  again  use 
the  notation  that 


as 


X„  . 


’xi  xi 


From  the  terminal  conditions  of  the  problem  we  have 

Xhf  "  V  Xvf  “  v2  *  Xuf  "  l>  Xtf  •  v3  <7"6) 

which,  by  virtue  of  the  differential  equations  of  the 
multipliers,  implies 


xh(t)  “  vi>  xv(t)  "  tan  0(t)»  xu(t)  ■  1, 

X^(t)  “  • 


(7-7) 
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Application  of  equations  (4-10),  (4-11),  and  (4-12)  results 
in 

6h  +  v  dt  -  0  (7-8) 

6v  +  (a  sin  9  -  g)dt  ■  0 

6Xh  "  dvi  *  0 

6X  -  X,  dt  -  dv_  -  0 
v  n  Z 

6X  -  0 
u 

6Xfc  -  dv3  -  0 

6X^v  +  6X^(a  sin  9  -  g)  +  6X^  3  cos  6  +  6X^  +  X^6v  *  0. 


Choosing  6X^,  ^u>  and  ^  as  independent  and  operating 

as  before,  we  have 


where  B  -  (a  sin  9  -  g) . 

Writing  the  multipliers  and  the  state  variables  at 
time  t£  in  terms  of  the  independent  variables,  we  have 
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"6xh 

1 

0 

0 

0  ' 

wh 

6X 

V 

0 

1 

0 

0 

6X 

V 

S 

0 

0 

0 

0 

6u 

.sxt 

1 

-V 

-B 

0 

-BXh. 

_6t_ 

(7-10) 


6h 

0 

0 

0 

V 

"6Xh 

6v 

0 

0 

0 

B 

6X 

V 

6u 

0 

0 

1 

0 

6u 

_6t 

fcf 

JO 

0 

0 

1 

Cf 

6t 

(7-11) 


The  linearized  differential  equations  are 

3t<‘h>  -  6v  (7-12) 

■j^(6v)  ■  a  cos^  0(6Xv  cos  0  -  6Xy  sin  0) 

^(6u)  ■  -a  sin  0  cos  0(6Xv  cos  0  -  6Xu  sin  0) 

3t<6t>  “  0 

lt(6V  ■  ■Jt(6Xu)  ’  lt(6Xt)  "  0 

Zt(6XV)  “  "6Xh 


where 

60  ■  cos  0  (6Xv  cos  6  -  6Xu  sin  0)  . 


(7-13) 
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Letting  fiX^,  6Xv,  6u,  and  6t  equal  1,  in  turn,  while 
equating  the  others  to  0  and  integrating  equations  (7-12) 
backward  in  time  with  initial  conditions  from  equations 
(7-10)  and  (7-11)  we  have 


~6xh 

1 

0 

0 

o" 

'6Xh 

6X 

V 

t 

1 

0 

0 

6X 

V 

6Xu 

m 

0 

0 

0 

0 

6u 

fXtJ 

t 

-V 

-B 

0 

-BXh 

tr 

6t_ 

(7-14) 


6h 

a_ 

cosJe  t 

3  -2 

a  cosJ0  c 

0 

v-Bt 

6 

2 

6v 

a_ 

3  -2 

cosJe  tr 

- 2 - 

3. 

-a  cos  t 

0 

B 

6u 

m 

a  sin 

2  -2 

9  cos  o  tZ 

— 2 - 

a  sin  0  co8^0  t 

1 

0 

6 1_ 

0 

0 

0 

fiXj 

SX 

\ 

6u 

6t 


(7-15) 


where  t  ■  -  t. 

Solving  equations  (7-14)  and  (7-15)  for  the  delta 
multipliers  at  time  t  in  terms  of  the  delta  state  var¬ 
iables  at  time  t,  we  have 
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Evaluating  60  in  equation  (7-13)  in  a  similar  manner  and 

i  4.1.4  69  69  j  69  69  £.  , 

ietting  '5K  and  we  find 


69  , 

6 

“SK  " 

1 1 

a  cos  0  (t^  - 

"o7 

(7-18) 

60, 

I 

4 

&V 

It 

a  cos  0  (tf  - 

T> 
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The  remaining  boundary  values  necessary  to  initiate  the 
recursion  are 

(7-19) 

(predetermined  as  a  part 
of  the  nominal  optimal 
trajectory) 


This  analysis  of  the  boundary  values  has  been  more  general 
than  necessary.  However,  it  should  serve  to  show  the 
utility  of  the  method. 


as. 

■5K 


as 

"Sv 


tan  0*(t) 


VIII .  RESULTS 


The  performance  of  the  linear  feedback  guidance  scheme 
was  evaluated  by  comparing  solutions  using  the  scheme  to 
correct  for  initial  condition  perturbations  about  a  nominal 
(optimal)  trajectory  with  true  optimal  trajectory  solutions 
for  the  same  perturbed  initial  conditions.  The  optimal 
trajectories  were  computed  using  the  method  of  steepest 
ascent  [4] . 

The  nominal  optimal  trajectory  solved  the  problem 

with  the  following  constants  and  terminal  conditions 

2 

a  «  2g  ft/sec 
g  -  32  ft/sec^ 
hf  -  100,000  ft 
V£  «  0  ft/sec 
tj  *  100  sec . 

The  initial  conditions  of  the  nominal  trajectory  at  time 

t  *  0  were 

h  -  0  ft 
o 

v  *  0  ft/sec 
o 

u  ■  0  ft/sec. 
o 

The  accuracy  of  the  computer  program  was  verified 
by  comparing  nominal  trajectory  results  from  the  computer 
program  with  the  desired  terminal  conditions  and  the 
analytical  solution  for  the  optimal  horizontal  velocity, 
u.  The  accuracy  was  considered  excellent. 
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Perturbations  in  the  nominal  trajectory  were  assumed 
by  varying  the  initial  conditions  of  the  problem.  Table  I 
shows  the  various  perturbations  assumed  and  both  the 


Table  I 


Optimal  Solution 

1 

Guidance  Scheme  Solution 

Case 

Uf,  ft/sec 

HQ 

Uf,  ft/sec 

Hominal 

3507.81 

6h(o)  -  1000 

3569.94 

100,000 

0.75 

3570.60 

6h(o)  -  -1000 

3443.43 

100,000 

-0.44 

3442.53 

6h(o)  ■  5000 

3796.14 

100,001 

6.70 

3798.61 

6v(o)  ■  50 

3733.78 

100,000 

3.29 

3736.11 

6v(o)  ■  -50 

3246.40 

— ■ 

-0.46 

3243.94 

6t  ■  3  sec 

2960.43 

100,001 

0.72 

2957.31 

computed  optimal  solution  and  the  guidance  scheme  solution 
based  on  the  above  nominal  trajectory.  (In  all  cases  the 
computed  optimal  solution  satisfied  the  desired  terminal 
constraints  within  0.1  ft  in  altitude  and  0.1  ft/sec  in 
vertical  velocity.)  The  feedback  gains  used  are  shown 
plotted  against  time  in  Fig.  1.  The  last  case  shown, 

6t  -  3  sec,  corresponds  to  the  case  where  97  seconds  of 
flight  time  remain;  i.e.,  launch  is  three  seconds  late. 


Fig.  1--Time  History  of  Feedback  Gains  for 
Idealized  Rocket  Trajectory  Problem 
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This  is  analogous  to  the  case  of  6h  ■  -138  ft  and  6v  ■ 

-92  ft/sec  ai  t  ■  3  sec,  since  the  nominal  trajectory  had 
an  altitude  of  138  *.  and  a  velocity  of  92  ft/sec  at  time 

3.  Figure  2  presents  a  plot  of  66  against  time  for  this 
case.  This  was  the  case  of  most  extreme  control  correction. 

It  is  impossible  to  precisely  evaluate  the  perfor¬ 
mance  of  the  guidance  scheme.  However,  the  following 
observations  may  be  made : 

1)  The  terminal  altitude  in  all  cases  was  highly 
satisfac tory . 

2)  The  terminal  vertical  velocity  was  such  that  it 
would  require  no  more  than  one  quarter  of  one 
second  additional  flight  time  to  drive  the  vertical 
velocity  to  zero.  (This  is  under  the  assumption 
that  a  perturbation  of  6h  *  -5000  would  result 

in  a  negative  v^  of  «6.70  and  that  the  available 
acceleration  was  applied  in  the  vertical  direction.) 

3)  The  final  horizontal  velocity  was  in  all  cases 
within  0.1  of  one  per  cent  of  the  optimal 
horizontal  velocity.  Note  that  in  some  cases 
where  terminal  conditions  are  not  quite  met 
"better  than  optimal"  horizontal  velocities  result. 
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0  20  40  60  80  100 
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Fig.  2 — Time  History  of  Control  Correction 
for  4t  ■  3  sec  Case 
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